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Abstract 

Consider Ginibre's ensemble of N x N non-Hermitian random matrices in which all 
entries are independent complex Gaussians of mean zero and variance -k . As N | oo the 
normalized counting measure of the eigenvalues converges to the uniform measure on the 
unit disk in the complex plane. In this note we describe fluctuations about this Circular 
Law. First we obtain finite N formulas for the covariance of certain linear statistics of the 
eigenvalues. Asymptotics of these objects coupled with a theorem of Costin and Lebowitz 
then result in central limit theorems for a variety of these statistics. 

1 Introduction 

A fundamental non-Hermitian ensemble of Random Matrix Theory (RMT) is that of N x N 
matrices with independent complex Gaussian entries of mean zero and variance jf. This model 
is typically attributed to Ginibre who derived ([14]) the joint distribution of eigenvalues {z^} 
lying in the complex plane C. That object is given by 

dP N (z l} Z2, ■ ■ ■ ,z N ) = — TT \z£ - z k \ 2 fi N (dzi) ■ ■ ■ fi N (dz N ) (1.1) 

Z/v 
IV l<e<k<N 

in which /j,j^(dz) = e~ N \ z \ 2 d^St(z)dS^(z) and is the appropriate normalizer. A straightforward 
Large Deviation analysis will take you from (1.1) to the Circular Law: for N f oo the measure 
wY2k=l^ z k tends weakly to the uniform measure on the disk \z\ < l. 1 In other words, the 
mean number of eigenvalues falling in some subset of the disk is well approximated by A^x the 
normalized area. The goal here is to describe the associated fluctuations. 
Our study is centered around linear eigenvalue statistics of the form 

N 

*(/) = £/(**)> 

k=l 

and the behavior of their covariances 

Cov N (x(f),X(g)) = f X(f)X(g)dP N - [ X(f)dP N [ X(g)dP N 

V ' j£N J C N J C N 
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1 Experts know this to hold under far more general conditions on the underlying distribution, see [1] for the 
best result. 
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as N | oo. In particular, if the statistic is one of either the moduli, = f(\zk\), or the 

angles, f{zk) = /(arg(zfc)), we obtain exact formulas for the covariance at finite N which 
are amenable to a precise asymptotic analysis. Afterward, noting that the Vandermonde 
component in (1.1) identifies the present ensemble as an determinantal point field 2 , we may 
apply a basic result of that theory due to Costin and Lebowitz along with our asymptotics to 
conclude a central limit theorem for X{f) in several cases. 

There is already a vast body of work on fluctuations of linear statistics in random matrix 
ensembles. The majority of these concern either Hermitian ensembles (for example, [2], [3], 
[4], [7], [15], [16], [18] [27], and [29]) or the ensembles defined by Haar measure on the classical 
compact groups ([9], [20], [19], [28], and [32] to name a few). Of course, works such as [30] which 
focus on determinantal point fields take on a more general point of view. Nevertheless, in the 
non-Hermitian setting it appears this type of fluctuation question is only directly considered 
in [12], and that more or less from a physics standpoint. In fact, one motivation for the 
present paper was the increased interest in the physics community in non-Hermitian ensembles 
([12] and [13] offer guides to that literature). A second motivation lies in the discovery that 
Ginibre's complex Gaussian ensemble possesses a structure allowing for exact formulas that in 
turn may be studied via rather straightforward mathematical machinery (basically Laplace- 
type asymptotics). Finally, it is expected that the results here should provide some indication 
as to the order and shape of fluctuations in more general non-Hermitian ensembles. 3 

We begin with a description of our covariance formulas. The case when / and g are 
functions of the spectral moduli alone is far the simpler of the two. The observation is that: 

Theorem 1.1 For statistics of the moduli, let f and g be bounded over W + then 

N 



Cov N (x(f),X( g ))=Y,{E[f(^s e ) 9 (^s e )} -E[f(^8 t )]E[g(J±s e )]} (1.2) 

l=\ 

where S£ = rji + • ■ ■ + % for {%} a sequence of independent exponential random variables of 
mean one. 



The angular statistics possess a more interesting structure, requiring the setting down of 
some notation. The angles or arguments of the {zk} are indexed to lie in the circle T = [—it, it], 
the latter normalized throughout according to J T dd = 1. We recall the Dirichlet kernel 

k=-t sin [2 9 ) 

which acts on functions / of T by convolution: (Dp * f)(9) = / T Di(0 — 0')f{6') d0' . As is well 
known, this action produces an approximation of the identity, that is, lim^oo (i^fc * f){6) = f(6) 

2 Also known as fermionic fields, these objects were introduced by Macchi in [23]; an excellent introduction 
to their applications in RMT and beyond is contained in [31]. 

3 Part of the sequel carries over to Ginibre's Quaternion Gaussian ensemble, see Section 5. However, describing 
fluctuations in the real Gaussian ensemble will require a new approach due to the intricacies of that eigenvalue 
density, see [21] or [11]. 
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at points of continuity of /. Figuring more prominently in what follows is a different kernel 
with that property. We define 

Ceie) = cos2 ^ } + ^ ilrfyf cos2£+1{e) (L3) 

in which T is the usual Gamma function. The result is: 

Theorem 1.2 For statistics of the spectral angles, let f,g G L 2 (T). We then have 

N-l 

Cov N (x(f),X(gj) = iv((/*5)(0)--^(Q*/*5)(0)) (1.4) 

e=o 

N-l 

~ (S Ce ~ D 2N-2i-2 * Cl) *(f* 3)) (0) 

in which g{9) = g(—9). 

Theorem 1.2 should be compared with the results of [6] which discusses the allied question 
for Haar distributed eigenvalues in the Unitary group U(N). As in [6], it is striking that 
the covariance depends on the test functions / and g only through / * g. Further, with the 
Cesaro averages ^2^=0^ Ci{6) also forming approximation to the identity, the content of (1.4) 
is that the large N properties of the covariance are tied to the error in that approximation. 
In [6] fluctuations for U(N) are shown to be related in the same way to the error in Fejer 
approximation . 

Next we turn to the asymptotics. It is of interest to connect the growth of the covariance 
in N to the smoothness imposed on the underlying test functions / and g. For example, the 

0(1) fluctuations of X(f) for / a bit better than twice differentiable plus a growth condition 

i. 

in the Hermitian case (see [18]) and for / G H£ (T) in U(N) (see [9] or [19]) are well known 
manifestations of the rigidity of random matrix ensembles. In line with those results, an 
immediate consequence of Theorem 1.1 is the following. 

Theorem 1.3 Let f(z) = f(\z\) and g(z) = g(\z\) lie in C 2 ' 5 for r = \z\ G (0,1 + e) with 
some 5 > and e > and otherwise be bounded. Then, as N f oo, Covjsf(X( f), X(g)) = 
\ Jo f i r )g' i r ) r d r + Moreover, the centered (but unnormalized) random variable X(f) — 
E[X(f)] converges to a mean zero Gaussian with the corresponding variance. 

Actually, the central limit theorem for X(f(\z\)) described in Theorem 1.3 has already been 
discussed in [12]. We in fact follow the ideas therein; a proof is included here for the sake of 
completeness and rigor. While [12] employs the same product structure behind Theorem 1.1, 
the probabilistic interpretation which guides our results and allows for more to be accomplished 
(see below) was apparently not noticed. Also in [12], the author conjectures (and provides 
heuristics based on the associated log-gas) that the general (smooth) linear statistic should 
have order one Gaussian fluctuations in the large iV limit. Our next result shows that this is 
not the case. 
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Theorem 1.4 Let the function f of the phase have Fourier coefficients f(k) = J T e lke f(9)d6 
satisfying ^^°oo ^ 2 1/(^)1 2 < 00 anc ^ likewise for g. Then 

Cov N (x(f),X(gj) = 1 ^- J f'(9)gf(9)dB + oQogN). 



\ f{k)\ 2 < oo for some 5 > 0, the o(logiV) error may 6e replaced by an O(l). 



// a 6i£ more smoothness on either test function is assumed, in particular if Yl^oo 1^1 

|2 



Of course, an O(logiV) fluctuation is rigid compared to the O(N) characterizing an ensemble 
of independent particles. Still, this marked difference between radial and angular statistics was 
not anticipated. 

Changing focus, an important class of non-smooth test functions to consider are indicators 
of given sets within the spectrum, thus providing an eigenvalue count. Setting this case of 
the number statistic apart, let us denote #0[a,(3] = -X"(l{arg(z fc )e[a,/3]})- and #I[a,b] = 
X(l{| Zfe | e [ a fe ]}) The next result concerns the growth of the variances of #@[a,(3] and #X[a, b], 
with added attention to the mesoscopic scales, that is, when \a — (3\ (or \a — b\) 10 as N f oo. 
It is found there is a break in the behavior depending on whether y/N\a — (3\ (or \f~N\a — b\) 
tends to oo or not. 

For the angular number statistic the result is: 

Theorem 1.5 Note first by rotation invariance it is enough to consider the symmetric interval 
[—a/2, a/2]. If a > remains fixed while N j oo, then 

Var^(#6[-|,|]) =^N-±^ + 0(logN). (1.5) 

// instead a = a(N) J, 0, Na(N) f oo, there are the following cases: 

{ V^y^(l + o(l)) if lim^ Too ^/Na = oo 

Var w (#G[-|, |]) = I ^N n -^I arg ((3) + O(logiV) if \im moo VNa = (3 > (1.6) 

[ JVa(l + o(l)) t/ lim^oo VJVa = 0, 

where, I arg {(3) > and is such that UmpioI arg ((3) = while lim^oo I ar g(P) = 1/ its definition 
may be found below in (3.11). 

The radial case has a related structure: 

Theorem 1.6 For fixed [a,b] € (0,1) we have that, 

V aiN (#I[a,b])=VN(^)+0(l). (1.7) 



If rather (b — a) j while N(b — a) f oo, i/ien 
Var 7V (#T[a,5]) 



^^(1 + 0(1)) */ limivtoo VJV(6 - a) = oo 

VN^Imod(c) + O(l) */ limjv Too >/]V(6 - a) = c> (1.8) 
k iV(6 2 - a 2 )(l + o(l)) */ limjv Too y/N(b - a) = 0, 



m which, similar to the above, I mo d (defined in (4-13)) is non-negative and satisfies lim^o I mo d(c) 
0, lim cToo I mod (c) = 1. 
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Remark Note that in the radial case the variance asymptotics are modulated by the limiting 
radial distribution, the corresponding angular distribution being uniform. 



We may now invoke the Costin-Lebowitz Theorem (see Section 5). First proved in [7] 
for the sine kernel and extended by A. Soshnikov in [29] and [30], in the present setting the 
theorem implies that: with Na f oo (respectively N(b — a) f oo) 

#e[-a, a] - Ew [#e[- a , Q ]] L r ..*IKH-%W..HA 9) 

tends to a Gaussian random variable of mean zero and variance one. Since we may also compute 
En[^Q[— a, a]] = N{2a) and E^[j^I[a, b]] = N(b — a) + 0(1), an interesting observation 
regarding the mesoscopic scales is that when limjvfoo y/N a < oo (or limjvtoo (b— a) < oo) we 
have that Varjv[#G[-a, a]} = 0(E N [#Q[-a,a]]) (or Yav N [#l[a, b]} = 0(E N [#l[a,b]])) as in 
the case of independent particles. Contrariwise, if lim7V|oo V~N& = °o (and limNtoo y/N(b—a) = 
oo) the slow growth of the variance compared to the mean indicates rigidity. These remarks 
are suggestive of a possible asymptotic independence taking place on the small scales. In this 
context we add that if one considers the outlying or edge eigenvalues (which may be considered 
a small scale) it is found that Varjv[^£X[l, oo]] = 0(En[#Z[1, oo]]) = 0(\/N). Further evidence 
of an asymptotic independence of the edge is contained in [25] and [26] which show the largest 
eigenvalues in absolute value respond to a limit theorem shared by independent sequences (see 
also Section 5). 

Finally, consider the limiting Gaussian field Qoo[a,/3] on T (or Ioo[a, b] on [0, 1]) resulting 
from the properly normalized statistics (1.9) for fixed arguments (a, (3) (or (a, b)) as N j oo. 
The corresponding correlations are anticipated to be such that, taking the first case, Qoo[a,(3] 
and Goo [S, 7] are independent if either [a, (3] and [S,j] are disjoint or one is properly contained 
in the other; those intervals can positively or negatively correlated when they share a single 
endpoint. This rather odd structure was first observed by Wieand (see [32]) for U(N) and has 
since been rediscovered in this and other ensembles by a variety of methods. If we wanted only 
to obtain this limit, an extension of Costin-Lebowitz could be employed (see [30]). However, 
another advantage of the formulas obtained in Theorems 1.1 and 1.2 is that we can directly 
describe the correlations at all large but finite values of N. Our last result is then as follows: 
note the shape of the second order terms which speak to the differing degree of rigidity in the 
two cases, 

Theorem 1.7 The covariance of#Q[a,[3] and #6[<5, 7] satisfies 

( ±0(^) if [a, (3} C [5, 7] or [a, (3} n [5, 7] = 0, 



Covat 



#Q[a,/3] #Q[5, 7 ] 



(jVvr3)l/4' (^3)1/4 



+ ifa = 5{(3^ 1 )orf3 = 1 {a^5), 

+ if(3 = 6(a^ 1 )ora = 1 ((3^S). 



For #X[a, b] and #T[c, d] the result is 

( ±0(e^ N ) if[a,b] C [c,d] or[a,b}n[c,d] = 0, 

xy } v > 7 y -| + 0(-J=) ifb = c{a^d)ora = d(b^c) 
where c\ > and depends on a, b, c, d. 
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As for the remainder of this note: in the next section we derive our basic formulas for the 
covariance (Theorems 1.1 and 1.2), the verification of the asymptotic statements are found in 
Section 3 (angular case) and Section 4 (radial case). Section 5 extends the present results in 
the radial setting to Ginibre's Gaussian quaternion ensemble. For the reader's convenience 
Section 6 serves as a brief appendix on the Costin-Lebowitz theorem. 

Remark In a recent extension of Costin-Lebowitz, [29] shows that (modulo technicalities) 
linear statistics in general determinantal point fields satisfy a central limit theorem as long 
as the variance grows faster than a small positive power of the expectation. The estimates 
below then imply a central limit theorem at rate TV 1 / 4 for X(f) (either in the angular or radial 
case) when / has anywhere a jump discontinuity. Unfortunately, the 0(log TV) growth of the 
variance in Theorem 1.4 is not fast enough from this point of view. A proof of the central limit 
theorem in the case of smooth angular statistics will appear elsewhere. 



2 Covariance Formulas 



Appropriate row and column operations in (1.1) allow the eigenvalue density to be re-expressed 
as in 



dP N (zi, z 2 ,... z N ) = — det 



K N (z e ,z k ) 



l<£,k<N 



fj,N(dzi) ■ ■■ fi N (dz N ) 



(2.1) 



with the Hermitian kernel 



N-l 



K N (z,w) = ^2 



N^ 1 



-z w , 



see [24]. As an operator on L 2 (C, fiN(dz)), Kn(z,w) projects onto the span of the first TV 
polynomials orthogonal with respect to that weight (those being just the monomials l,z, z 2 
etc.). This fact explains the rules f c K^(z, z)fj,N(dz) = TV and j c K^{z,v)K^{v,w)iiN{dv) = 
Kn(z, w), from which it follows that the marginal densities of Pn are given by: 



P N ( Zl ,...,z k ) 



(2.2) 



1 



L j£N-k Jy I 

(TV - A;)! 



TV! 



det 



K N (zi,Zj) 
K N (zi,Zj) 



fi N (dz k+1 ) ■ ■ ■ fi N (dz N )^n N (dz 1 ) ■ ■ ■ HN{dz k ) 



l<i,j<N 

HN(dzi) ■ ■■jj, N {dz k ). 



l<i,j<k 



(normalized differently these objects are also known as the correlation functions). Applying 
the integrating-out rules behind (2.2) along with the symmetries of the integrands below we 
find 



En 



X(f) = N / f( Zl )dP N 

c N 



f(z)K N (z,z)n N {dz) 



and 



Cov N (x{f),X{g)) 



f(z)g(z)K N (z, z)n N {dz) 



(2.3) 



(2.4) 



// 



f(z)g{w)K N (z,w)K N (w,z)n N {dz)n N (dw). 
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With that our expressions for the covariance may be established. 

Proof of Theorem 1.1 In polar coordinates, z = (r,a),w = (s,f3), you find that 



N-l 



p J J _ o poo 

/ f(\z\)g(\z\)K N (z,z) m (dz) = N e+1 - / f{r)g{r)r 2l+1 e- N ^ dr, 
J c V- Jo 

while in the second integral of (2.4) only the diagonal term survives: 

/ / f(\z\)g(\w\)\K N (z,iu)\ 2 n N (dz)fi N (dw) 
Jc Jc 

= I f{r) 9 {s)r^s^e-^e-^ 2 drds_. 



The connection with (1.2) is then made after a change of variables: the distribution p(dr) := 
(N i+1 /£\)r e e~ Nr dr being recognized as that for a sum of (£ + 1) independent exponential 
random variables, each of mean The proof is finished. 

Proof of Theorem 1.2 From here on we denote 4>(9) = (f * g){9). Also, in the present 
derivation we consider N to be even just to have things set; the formulas are easily adjusted 
for odd values of N . 

To arrive at (1.4), it is now the radial component which is integrated out. Doing so in the 
first term of (2.4) is immediate: 

/ (fg)( a rg(z))K N (z,z)fi N (dz)= [ f(9)g(9)d9 f° K N (r,r)re- Nr ' 2 dr = N<f>(0). 
Jc Jt Jo 

While for the second term we have 

/ / f(^g(z))g(&rg(w))\K N (z,w)\ 2 fi N (dz)[i N (dw) 
Jc Jc 

f f r ( f°° r k+e+1 p- Nr2 d r) 2 i 

= / / f(a)g((3)\ y e ^k-D e -i0(k-i) Uo e ^N k+e+2 ]dadl3. 

hh l <ki<N-i km J 

With <p{k) = fj e lk9 (f)(9)d9 = f(k)g(—k) the k-th Fourier coefficient of <f>, this produces 



Cov N (x{f),X(gj) = N<K0)~ E [ T (^ + l )] 2 %F 1 M 

0<k,e<N-l 

:= N</>(0)-S° N (<t>). 

as a preliminary form of our covariance formula. The obvious next step is make the substitution 
(k + £, k — £) = (2n, 2m) for k and £ of the same parity and (k + £, k — £) = (2n + 1, 2m — 1) 
otherwise. The result of that move is 

s o U) __ y> (n\) 2 j(2m) ^ (T(n + §) ) 2 0(2m - 1) 

N{ ) (n-m)\(n + m)\ ^ (n + m)\(n - m + 1)! 
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in which the sets T>n and T>' N in the (m, n)-plane are described as follows. T>n consists of two 
triangles: the lower defined by the points (0, 0), (N/2 - 1, N/2 - 1) and (-N/2 + 1, N/2 - 1), 
and the upper defined by (0,7V - 1), (N/2 - 1, N/2) and (-N/2 + I, N/2). The set V' N = 
T>n U 2?^ where the latter consists of all integer points lying on the lines between (1,0) and 
(N/2 + 1, N/2 - 1) and between (N/2 + 1, TV/2) and (1, N - 1). 

To evaluate (2.6), first consider the sum over the lower half of T>n- In particular start 
with a fixed n < N/2 — 1 so that the corresponding sum over m ranges fully between — n 
and +n. This object may be computed by the following observation. Denote by £i, (2, ■ ■ ■ a 
sequence of independent ±1 Bernoulli random variables with mean E, and notice that: with 



E 



(n!) 2 £(2m) 



(n — m)\(n + m)\ 
= c(n)sfei + --- + C2n 



2n 



c(n) / (£ 
/t 



-1C16 



c(n) £ ^(2m)^- m j2-*» (2.7) 

)%(#)d# = c(n) / 0(0) cos 2n (#) d0. 



In the same way we have 

"+1 , o /<i\\2 



£ (r(n + 3/2))^(2 m+ l) _ (r(n + 8/2))' y 
4^ (n-m)!(n + m + l)! (2n+l)! A 



for the typical term in lower half of the V' N sum. Pairing the final expressions in (2.7) and 
(2.8) in n brings out the sum of (C n * 4>)(0) for n from to N/2 — 1. 

The stated form of the covariance is then obtained by extending the summation over m 
in the upper reaches of the n variables (extending to a "big triangle"), in order to produce a 
summation of C n over all n < N — 1. That is, we write 

S%(4>) := S N (c/>) + S' N (4>) 

with Sjsr((f>) = ~^2m=o(C n * </>)(0) and S' N ((f>) the error involved in throwing too much into 
the mix. To complete the picture, we spell out the contribution to S' N ((p) connected to the 
equal-parity, or T>n, sum. That reads: with P2n(m) = ( Tt +^)2~ 2n and c(n) as before, 

JV-1 n 



^2 c(n) ^ <K 2m ) 1 ~ 1 {\m\<N-l-n} P2n(m) 



JV-1 



= ^2 c ( n ) [(^~ ( D 2N-2n-2 *</>)) (2m) P2n(m) 

n= HL m=—n 



2 

->2n ' Zn 



J2 22n (Z) l T cos2n{6] o{<)) - {D 2S --i l ,-2^m 



d0. 



The sum corresponding to the V' N part of our expression is similar; that its outcome is as 
advertised should be clear. The proof is finished. 
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3 Angular Statistics 



That the kernels Ce form an approximation of the identity is seen from: (1) the evaluation 
J T C e (9)de = 1 (based on J T cos 2e (9) d9 = 2~ 2t (™) and J T cos 2e+1 (9) d9 = 0), and (2), the 
basic estimate 

J £ C e (e)d6 = ][^f e- ee2 d6 + o(l) = 1 + o(l) (3.1) 

valid for any small e > 0. The proofs of the main theorems hinge on the justification of exactly 
this sort of standard Laplace asymptotics, though it is necessary to extract the precise form of 
the error terms. For the applications in mind, Ci acts on one of two classes of test functions. 
We restrict our attention to those cases: 

Lemma 3.1 Let h{9) be twice continuously differentiable on T. Then, as £ \ oo, 

(Q * h)(o) = jf h(0)c e (0) do = mo) + ^ + v -^y l + o(-gp) ■ (3.2) 

In general, v(£,h) = o(l) for large values of £. This may be improved to v(£,h) = 0(£~ s / 2 ) if 
h"(9) happens to be Holder continuous of order 5 > in a neighborhood of the origin. 
Let instead h(0) be a continuous, piecewise-linear function of 9 £ T. In that case, 

is the appropriate estimate for I j oo. Here h' + and h'_ indicate right and left derivatives. 

A comment is perhaps in order at this point as to the comparison between the kernel arising 
and being analyzed here and the better known Dirichlet kernel. The content on the second 
equality in (3.1) is that Ce(9) concentrates its mass in a neig hborhood of width 0{r 1 / 2 ). 
The concentration of the Dirichlet is sharper, taking place in a neighborhood of order l^ 1 . 
Returning to the discussion after Theorem 1.2 this explains why the fluctuations for the angles 
in Ginibre's ensemble may be expected to be larger than those for U(N). 

Proof of Lemma 3.1 For h in either class, it may be assumed from the start that h(0) = 
(Jt d9 = 1). Also, throughout the proof we will make use of the simple estimates 

which follow from Stirling's approximation in the form = e~^v / 27r(l + yr^~ 1 +0(£~ 2 )). 
Making the abbreviation 



Ct(e) = [2 2 ^]cos^ W+ [2 2 ^^±|jcos-- W 

:= a{i) cos 2e (9) + b(£) cos 2m (#), 

the rule r(fc) = (k - l)F(k - 1) provides the identity b(£) = a{£) x (£ + i) x (F(£ + ±)/£\) 2 , 
and from (3.4) we note that a(£) ~ b(£) ~ \Tk£. The basic behavior of Q for £ \ oo may then 
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be described as a sum of two (approximate) point masses of weight ~ 1/2 at 9 = and the 
difference of two such masses at the common point 9 = ±7r. Our first step is to dispense of 
the cancellation which must take place at the latter point. 

Consider in particular the integral J^ 2 h(0)Cg(8) d0, with that over 9 G [— ir, — tt/2] being 
treated similarly. Denoting g{9) = h{9) — h(ir) we write 

f'TX f'TV f'TV 

/ Cn(9)h(9) d9 = h(ir) / C e (9)d0 + (a(£) - b(£)) / cos n (9)g(9) d9 (3.5) 

Jn/2 Jn/2 Jn/2 



PIT 

+b(£) / ■ 

Jn/2 



cos 2e (9)g(9)(l + cos(9))d9. 



This object is to be dominated by a constant multiple of £ 3 / 2 . The first term on the right 
hand side responds to an exact calculation plus by an appeal to (3.4): 



f C e (0)d9 = a(£) [ 1 cos 21 '{9)d9 - b{£) [ ' cos 2m 
Jtt/2 Jo Jo 



(9)d9 



As for terms two and three, our regularity assumptions on h imply that |g(#)| < c\\tt — 9\ and 
so \g(9)(l + cos(^))| < C2 1 7r — 6* | 3 for 9 G \k — a,ir] with a small a > 0. Also on that same range 
we have the bound cos 2e (9) < e~ C3 ^ e ~^ for C3 = 03(a) > 0, while a fixed distance away from 
9 = 7r (or 9 = 0) cos 2£ (9) is exponentially small. It follows that 



cos 2e {9)g(9)d9 



tt/2 



< C4- and 



cos 2e {9)g{9)(l + cos{9))d9 



tt/2 



< C 5 



£ 2 ' 



Using again (3.4) shows that 

a(£) - b{£) = 2 2e 



2(: 



-1 



+ 



ix/r(^+|)\2 



£\ 



O 



and completes our consideration of (3.5). 

Turning our attention to the contribution to the integral from the vicinity of 9 
first the case that h € C 2 and use Taylor's theorem with remainder to write 



0, take 



h{9) = h'(0)9 + ^h"(0)9 2 + J° (h"{a) - /i"(0)) (9 - a)da, (3.6) 

(recall h(0) = 0). Since Ce(9) is even there is no contribution from the first term on the 
right hand side: f™^ 2 Ce(9)9 d9 = 0. Continuing to the quadratic term, one may again cut 
down the range of integration. Needing to be more precise this time around we restrict to 
\9\ < r 1 / 4 , the ensuing error still exponentially small. Further, on that range it holds that 

|1 - cos(0)| < 9 2 and |1 - e m2 cos 2e (9)\ < U9 4 . Using these estimates in J^ 2 /2 C i {9)9 2 d9 = 
Ci(9)9 2 d9 + 0(e~ C6 ^) and then restoring the limits of integration over the whole line, 
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we have 

r /2 Ce (9)9 2 = ^- r /2 9 2 (i + cos(e))cos 2e (e)de+ a{i) ~ m f /2 e 2 cos 2£+ \e) de 

J-n/2 27r J-n/2 4?r J-n/2 

= yiy°° e 2 e- m2 de + o^j 00 (r^e 2 + ^/ 2 # 4 + £ 3 / 2 e 6 ) e - e02 do) = 1 + o(^). 



There remains the contribution due the last term on the right of (3.6). The kernel Ce is to be 
integrated against 

[ (h"(a) - ti'(0j) (9 - a)da < 9 2 max \h"(9a) - h"(0)\ := 9 2 9(9, h) (3.7) 

Jq V / -1<Q<1 

from — 7r/2 < 9 < ir/2. By continuity, the nonnegative function v satisfies v \ with J. 0. 
Running through the above arguments will explain why the integral in question is bounded by 
a constant multiple of 

VI / 9 2 v(9,h)e~ w2 d9 < - max 9(6, h) := -Mt,h) 

with lim^ioo ^(£, /i) = as promised in (3.2). The comment following that display stems from 
the fact that if h" is Holder continuous with exponent 5 > 0, the right hand side of (3.7) may 
be replaced by a constant times 9 2+s . The ensuing integral in the last display would then 
decay like H 1+5 / 2 ). 

The details behind (3.3) are much the same. One may assume that h(9) = h + 9 + — h-9~ 
in some neighborhood of the origin and the leading order comes out of J °° 9e~ e02 d9 = 1/21. 
The proof is finished. 

Proof of Theorem 1.4 As / and g each have one L 2 derivative, their convolution (f)(9) = 
(/ * 9)(Q) is twice continuously differentiable. Recalling the covariance formula 

Cov N (x(f),X(gj) = N<f>(0) - S N (<f>) - S' N ((f>) 

for 

N-i . 

S N (cl>) = Y, / Ct(e)<f>(6)d0 and S^tf) = Yl C e *{4>- D 2N -2e-2<l>)(0), 

£=0 Jj N/2<£<N 

we are in the setting of the first half of Lemma 3.1. The statement (3.2) implies that 

S N (<j>) = N<f>(0) + 1^(0) £ \ + £ V -^T L + 0(1). (3.8) 

i=\ e=i 

(Here, the 0(1) term includes both the sum of the order t~ z l 2 errors in (3.2) as well as the sum 
of first several terms (Cg * </>)(0) where the estimate is of no affect.) The second term produces 
the logiV figuring into the leading order growth of covariance. Generally i>(£,(ft) = o(l) and 
we can conclude only that the third term YleLi ^j^- = o(logiV). The point of the closing 
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remark in the theorem's statement is that the proposed additional decay on \f(k)\ (or |g(/c)|) 
for k | oo implies Holder continuity of (p" . In particular, for positive S satisfying 5/2 < 1, 

oo oo 

<"(9)-4>"(0)\< £ k 2 \f(k)\\g(k)\\±-e lke \<M9\ S/2 E \k\ 2+S/2 \f(k)\\g(k)\ 

k=—oo k=— oo 

in which the last sum is finite by Schwartz's inequality. By the related remark in the statement 
of Lemma 3.1 it follows that the term ^ 1 v(l 1 4>) is summable in this case. In either case, 
N(p(0) — Sn(,4>) already accounts for the advertised asymptotic behavior of the covariance 
(note that <f>"(0) = - / T f'(9)g'(9)d9). 

The remaining (error) term S' N ((f>) is seen to be of constant order under the present smooth- 
ness assumptions. We write 

S 'n(<P)= £ E C e (k)kk), (3.9) 

N/2<£<N \k\>2N-2£-2 

and introduce the simple facts: = for k > 21 + 1, and, in general, |Q(&0I < 2. It 

follows that 

\s' N m < ci e E (\Hk)\ + \H-k)\) 

0<£<N/2 N-2i<k<N+2l 

00 1/2 

< C2 e \k\(\m\ + \H-k)\) < C3 ( e i^imi 2 ) 

0<k<N fc=-oo 

after changing variables and then the order of summation in inequalities one and two, an 
application of Cauchy-Schwartz for the third, and at last the given fact: Yl l^| 4 |<A(^)| 2 ^ 
(£ |m| 2 |/(m)| 2 ) {J2 \n\ 2 \g{-n)\ 2 ) < oo. The proof is finished. 

Proof of Theorem 1.5 When f(6) = g{6) = l[_ a /2,a/2](0)> the convolution := (f> a {9) is the 
tent function 

and the main contribution to the variance is then 

JV-l ,. a N-l a 

N(j) a (0) - S N (4> a ) = N— - E / + |0|Q(0)d0. (3.10) 

£=0 Q £=0 a 

Let us first consider this object for the various situations, a fixed or a [ as N f oo, and 
return to the error term S' N ((j) a ) at the end. 

From the proof of Lemma 3.1 it follows that for a > fixed ± /* Q C e {9)d9 = 1 + C>(^ 3 / 2 ), 
and so the first two terms in (3.10) contribute something of constant order. On the other hand, 
from the conclusion of the second lemma we have that 



4 ^ g 1, ww* = s g Its + + 0(1) = ^ + 0(1) - 
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which is the advertised result (1.5) for order one regions of the phase. 

If a I with N f oo, we start in the borderline case in which a(N) = /3/VN with a fixed 
(3 > 0. At this scale the quantities in (3.10) go over into approximating Riemann sums. First 
there is, 



t=l J -<* 



N -«w Vn i A 



■ ' \-\e\e~ 6 de + o(i) 



N 



ViV-^ y -j= J ee~ e d6dx + 0[VnY1 J e ! [W/^O - *M*)J dx) + 0(1 



for ^(x) 



1-e" 



^=1 " "Tv" 

-, and that error term is controlled (for whatever 0) as in 



/I [Fp(e/N)-F p (x)]dx<-^=J o \Fj t (x)\dx = 0(-±=). 



i=\ J ~Tv 



(The difference between an integral and its Riemann sum being less than 1/Nx the total 
variation of the integrand.) In a similar way we find that 



N 



a(N) a(N) 



2vr 



a(N) 



N-l 



C t {9)d6 = VN 



1 



27T 27T 7 J-^ y/lT 



■ dOdx 



+ 0(1). 



Combined, the last three displays translate as, 



vTv vTv 



(3-11) 



N- 



7T 



3/2 



r 1 

J 



-/3x 2 



2^ 



1 /»oo 



'/./ 

JO J/3 



dx + (3 I I e~® 2 dOd.r 



+ 0(1) 



:= v / iV^/ arg (/3) + 0(l). 



Next, if viVa(iV) j oo one can clearly apply the same steps with the point of view that 
ft = p(N) t oo. Noting that 

as (3 | oo completes the explanation of that case. Finally, if y/~Na(N) j while maintaining 
Na(N) | oo, things are a bit different. In the range £ < N and \f~N6 = o(l), 

C t {0) = 2v^(l - £6 2 + 0(^# 4 )) (l + oQ)+o(^ 



Substituting the above into (3.10) we find that 

AT-l Q JV-1 a 

t^E/ C e (6)d6-^Y, \0\Ct(8)d9 = Na 



Na (VNa) 



\3-, 



3vr 3 /2 3 07r 3/2 



+ 0(1), 
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which anyway is o(Na), identifying the leading order in this case as N(a/2ir). 

To complete the proof, we need to track the growth of S' N . A bit of good fortune is that 

one may compute 4> a (k) = sm J^ > which leaves us to control 

\S' N (<f> a )\ <d £ E 1^)1 (^^), (3-12) 

N/2<l<N 2N-2£<\k\<2£ 

see (3.9). Simply bounding \Ce(k)\ sin 2 (ka) by a constant will produce the estimate S' N (4> a ) = 
O(logiV) stated for the cases where liminfjvtoo V~Na > 0. 

If however, a = o(l/y/N) we need to, and can, do better. Bring in the additional estimate 
|C^(&)| < C2\fi\k\~ 1 courtesy Lemma 3.1 which has bite for \k\ » t. With this in mind the 
sum is split as follows: with £*(N) ~ N - \y/N the point where 2N -21 = \fi over I > N/2, 

\S' N (4>a)\<[ E + E + E }\Ce(k)\(^^):=A N + B N + C N . 

N/2<i<e*(N) e*(N)<e<N e*(N)<e<N 

2N-2t<\k\<2t ^I<\k\<2l> l<\k\<Vt 

Moving from left to right, and first bounding the summand by V^l&l -3 we have 
A N <c 3 £ ^—^_< C4 ^N £ ji=°(ti> 

N/2<l<N-\^N \VN<t<N/2 

as well as 

*>*<<* E E E ^n = m - 

N-^/N<i<N VI<k<oo N—\/N<£<N 



For Cn we finally use the fact that a [0: as k~ 2 sin 2 (a/e) < 2a 2 for a = o(l/y/N) and k < VN 
we conclude 

< c 8 (^« 2 ) = o(l) 

N-VN<i<N 

as needed. In closing we note that a careful review of the Cn will demonstrate that the stated 
0(log N) error term in the cases where liminfATfoo y/Na > cannot be improved. The proof 
is finished. 

Proof of Theorem 1.7 (angular case) In each on the four cases, the covariance of the 
pair (#0[a,/3], #©[#, 7]) is connected to a 4>k(9) (k = 1,2,3,4) which is either constant in a 
neighborhood of the origin or possesses a corner at that point. In particular, when the intervals 
in question are either disjoint or else [a, ft] sits inside of [S, 7] we have 

1 p-\ j on [ft-S,27r + a- 7 ] when a < ft < 5 < j 

M6) - = 2^L e W e )^={^ on[<5-a, 7 -/?]when5<a</3<7 

The relevant computation is then: for (f)(6) = (f)(0) over — e < 9 < e, 

N ~ 1 r iV ~ 1 r £e rift 

N(f)(0)-J2 / C e (0)md9 = N(f)(O)-mY,\ e^^ + O^ 2 )} =0(1). 
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As for 5^(^1,2), we will estimate fm 2 (0) := {4>i t 2 — D m * 0i,2)(#) (which is evaluated at 
m = 2N — 21 — 2 and then summed over £) near the origin. In particular, we have already seen 
that by adding a constant if necessary we can assume that 4>i,2{@) = in [— e, e], and we want 
to show that f m can be made uniformly small throughout say [— e/2, e/2\. Now, 

fk\0) = X ^Hp^P ^{(m+\)e')de' := j Tgh2 (9',9)sm((m + 1 -)9')de', 

and if \9\ < e/2 then 4>i,2(9 — 9') = for all \9'\ < e/2. As otherwise <pi^ is piece-wise linear, 
the second derivative of g\ ) 2{9') will certainly be integrable away from [—e/2, e/2]. Therefore, 
integrating by parts twice will show that fm 2 (9) = 0{m~ 2 ) throughout \9\ < e/2. This in turn 
will imply that 

(ce * 7^-2) (o) = 0(^4^5) + or 3 / 2 ) 

which is summable over the appropriate range of I: S' N (^>\^) is of constant order. The de- 
correlation at the proposed rate is thus established. 

Taking next the situation that a < /3 = <5 < 7, it is enough to note that there exists an 
e > with 

Ue):= ^Le %^)^={ 27r for *€ [0, £ ], 
whereas if a = 5 and /3 / 7 (take /? < 7 for convenience) we may say that 



2tt 

for some e' > 0. It will then follow from the second half of Lemma 3.1 that 

^ / r.Jfi\rh,(ft\ rift = - , 

2 V 7T 3 



r 1 / yv f 1 / TV 

2Vtt* f^Jj 2Vtt» 

Furthermore, re-tooling that part of the previous proof (for the variance estimates) related to 
the S' N term will show that 5^(^3,4) = O(logiV); the corner at the origin being the cause of 
this increased growth. The proof is finished. 

4 Radial Statistics 

We begin with the proof of Theorem 1.3 which relies on the product formula 

N N 
k=l k=l 



[n/(N)]=n^[/(v^)]- (41 



where you will recall that = rji + ■ ■ ■ + % with the 77's independent exponentials of mean one. 
While related to the covariance formula (1.2), it is easier to see (4.1) as direct consequence of 
(5.1) derived in the next section. 
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Proof of Theorem 1.3 Let h satisfy the assumed boundedness and regularity. For con- 
venience we consider test the function r — > h{r 2 ) with a Gaussian limit for ^2 k= ih(\z k \ 2 ) — 
E[J2k=i h(\zk\ 2 )] proved by computing Laplace transforms. In particular, the formula (4.1) is 
applied with f(r) = exp [Xh(r 2 )] and A a parameter lying in a fixed neighborhood of the origin 
to find that 



E 



Bxp^\[X{h)-EX{h)]^ = E 



exp{\[J2(H\z k \ 2 )-Eh(\z k \ 2 ))}} 

k=l 
N 

= Y[E[exp{\[h(s k /N)-Eh(s k /N)]}]. 



(4.2) 



k=l 



We wish to expand each appearance of h(s k /N) as in h(s k /N) = h{k/N) + hi \k/N){(s k ■ 
k)/N)+\h"{k/N)((s k - k)/N) 2 +R h (s k /N,k/N). Toward this, make the definitions h(s k /N) 
h(s k /N) - E[h(s k /N)}, 

s k - £r 



and 



M N , k (h) :=exp{A/ l '(fc/iV)(^^)}, 



n Ntk (h) := exp { ^h"{k/N)({s k - kf - k) + \R h (s k /N,k/N)} 



in which R h (s k /N,k/N) = R h (s k /N,k/N) - E[R h (s k /N,k/N)], and the right hand side of 
(4.2) may be continued as the product of 



E 



exp[\h(s k /N)] =E M Nyk (h) 



(4.3) 



+E 



{e XHsk/N) -M N , k (h))t { ^ 1+£} 



+ E 



M 



N,k 



(h)(n N , k (h) 



L {l<l+e} 



from k = 1 to N. 

Computing the product of the first term in the right of (4.3) produces the Gaussian struc- 
ture: choosing A so that |A| max re [ 0) i] \h'(r)\ < 1/2, 



N N h 

log H E [M N , k (h)} =~J2 (^'(k/N)- + k log(l 



k=i 



k=l 



Xh/jk/N) 
N 



)) 



N i 2 N h i r 1 

= -J2k(\h'(k/N)j-) +0(Y,^)=-^ 2 (h'(r)) 2 rdr + o(l). 
k=i k=i J ° 

Now similar estimates to those used in the last display will show that E M.N,k{h) is bounded 
above and below by positive constants independent of k or N, and so the proof is completed 
by establishing that 



v 



J2{E[(e xh ^ N ^ + 



k=i 
as N | oo. 



+ E 



M N , k {h){K N , k (h) - l)l { fi< 1+£} ] } 
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For the first part, let c\ = sup < r<oo h(r) and write 



E 



(e xh ^+M N , k (h))l a 



>l+e} 



< e 2|A|cip^ > 1 + £ j +E 



iV 

iV \fe/2 



e 1 {l>l+4 



in which the last line is easily bounded by a constant multiple of e~^ N for 7 > depending 
on e but not on k. For the remaining part, note first that on {sk/N < 1 + e} we have that 
■M.N,k(h) < exp (1 + e) and also that the exponent in 7?.jv,fc(M m ay be bounded uniformly in 
k and N. Since \e a — 1| < |a|e' a ' we have that 



E 



M N;k (h)\K N;k {h) - l|l { fi<i 



(4.4) 



fc\2 



Vk 



+ c 3 E 



\R h (s k /N,k/N)\ 



with constants c 2 and c 3 . Clearly E[\(^^) 



] = o(l) for | 00. It follows that the first 
term on the right of (4.4), summed from k = 1 to N, is also o(l) as AT j 00. For the final term 
in (4.4), by the regularity of h we may assume that \Rh{s k /N,k/N)\ < c± \ Sk ~ k \ 2+a (having 
applied this estimate while the restriction {s k /N < 1 + e} was still in place). The upshot is 
that 



N N 

£ E [\R h (s k /N, k/N)\] < E h " fc|2+Q 



fc=i 



k=i 



JV 



'7V2+' 



2 + a 
2 ~ 



fc=l 



as desired. Here, the second inequality may be arrived at rather directly in the present setting 
of exponential random variables. More generally, an inequality due to Marcinkiewicz and 
Zygmund (which may be found in [8]) will explain why E[\ Yle=i x (\ P ] — ^ 2 for p > 2 and 
{xf\ independent copies of a mean-zero x\ for which < 00. 

Finally, the statement concerning the asymptotic behavior of the covariance may be gleaned 
from the above by setting h = (5f + jg for parameters and 7. Having controlled moment 
generating functions, we have adequate tightness to draw conclusions on the limiting covariance 
as well as higher moments. The proof is finished. 



Moving on to the covariance of the number statistic #X[-,-], the basic error estimate re- 
quired has in fact already been done for us. It is embodied in the classical Edgeworth expansion 
providing corrections to the local central limit theorem. The statement is: with p(a, M) the 



density of the random variable -t= YldLiiVt ~ 1) a * 



M 



sup 

— oo<a<oo 



p(a,M) - 



-a 2 /2 _ 



27T 



M 



OiM- 1 ), 



(4.5) 



see for example [5], Corollary 19.4. 
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Proof of Theorem 1.6 Start with a fixed interval [a, b] (0 < a < b < 1) of the moduli for 
N | oo. Denote by pN,k(A) = P(j^s k G A). The variance of #I[a, b] then reads 

N N 

Va rj v(#J[a,6]) = ^p N , k ([a 2 , b 2 }) (l - p N , k {[a\b 2 ])) := ^Pn,^ ,b 2 ])p N , k {[a 2 ,b 2 ] c ), 



k=i 



k=l 



(4-6) 

and the (classical) central limit theorem explains why it should be only 0(\Z~N) neighborhoods 
of k = Na 2 and k = Nb 2 that contribute to the sum on the right. Breaking up the sum in 
accordance with that observation, we write: 



Vai w (#Z[o,i>] 



E + E + E' 

Jfc-jVa 2 |<<5jvA/iV \k-Nb 2 \<S N VN 



PN,k([a\b 2 ])p N , k ([a 2 ,b 2 f). 



Here J2' nas m general three components (k G [0, Na 2 - 5 N \^N] U[Na 2 + 5 N \^N, Nb 2 -5 n VN] 
U[iV6 2 + 5nVN, N]), and 5 n is some rate chosen so that this sum is negligible for N f oo. We 
spell out the estimate for the first component, the rest being nearly identical. 

With {Afc} a sequence of positive constants less than one to be determined, we have: 



^2 PNM[a 2 ,b 2 ]) PN , k ([a 2 ,b 2 ] C )< Yl P (s k >Na 2 
k<Na 2 -5 N y/N k<Na 2 -S N VN 

k 



(4.7) 



< ^2 exp 

S N y/N<k<Na 2 

< e~ k2 ' 2Na2 < Ci VnJ 
5 N VW<k<Na 2 



Jv(a 2 A t + (o 2 --)log(l-A t )) 



S N 



in 



The second inequality invokes Chernov's bound plus the change of variables k — ► [A^a 2 J — k. 
The third follows from the choice of = jf-j, and the fourth from a Taylor expansion. The 
conclusion of the last line is that letting 5n = ^logN implies Yl PN,kO- ~ PN,k) = "(1). 

Turning to the sums of consequence (over \k - Na 2 \ < 5 N VN and \k - Nb 2 \ < 5 N VN), 
it is evident that they will have equal contributions. Consider the first sum and rewrite the 
typical appearance of PN,k([a 2 , b 2 ]) as: with k = Na 2 + £ and t ranging between ±5nVN, 



P^s Na2+e G [a 2 ,6 2 ]) 



(4.8) 



'( 



i) > 



+ e N (t,a))-0(e- c ^ N ). 



Here 



\e N (,£,a)\ < 



VWa? VNa 2 + 1 



+ (Na 2 + t - [Na 2 + l\) < Cl -^= + c 2 

Vn^ ' Vn 



'TV 3 / 2 ' 



and the (exponentially small) error term stems from a standard large deviation estimate. 
Applying (4.5), we may then continue (4.8) as in 



PN,Na 2 +i([a ,b ]) 



(4.9) 
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■/Na s/Na 

The first equality is achieved by first cutting down the integration to |c| < logiV and then 
applying the Edgeworth expansion. The error produced by this process of limiting the range 
of integration and later restoring it up to +00 may be controlled by the same large deviation 
procedure used already twice before and the Gaussian estimate J°° e~ x l 2 dx < a~ 1 e~ a I 2 . The 
second equality is self-explanatory. 

The leading order of the variance is then read off by subtracting the square of (4.9) and 
summing in I: 

E PN,Na^ + e([a 2 ,b 2 ])(l-p NtNa 2 +e ([a 2 ,b 2 })) (4.10) 

-y/N8 N <t<y/N6 N 

= ^ V -L/f e^-[f e^l 2 ^=] 2 } + £ l{ N) 

-VN5 N <£<VN5 n VWa -jNa 

a a 

The error involved in taking the sum over to the (full) integral in the second line is easily 
controlled: 



poo poo r — = px' 

£2{N) < c 3 ^N / e- x2/2 dxdx' + c 4 ^N i t e ~ X2/2dx dx ' ( 411 ) 

5n x ' \£\<Vns n ~7n 

\e\<VNS N 



It remains to show that £\(N) = 0(1). Terms three and four in (4.9) and their squares, 
summed over the range ±y/N\ogN clearly remain finite as N f 00. As for the second term, 
note that it is already of the form iV -1 / 2 times an integrable function of IN" 1 ! 2 . That is, it is 
already properly normalized so that the corresponding sums go over into (convergent) integrals 
as N I 00: an instance of this being, 



L ^<c 7 l £ (l + ^)e-^<c 8 [ (1 + xV^ 

(4.12) 

after an integration by parts. Finally, the integral in line three of (4.10) is evaluated as = 

and adding the corresponding term for k ~ Nb 2 completes the proof of (1.7). 

The proof for mesoscopic scales (1.8) builds on the above. Let us set, y/N(b — a) = c, and 
consider first the cases that c is fixed (the critical case) or c f 00 with N. Also, we take the 
interval in question to be [a, a + c/y/N] for a fixed a. While one may also consider symmetric 
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intervals shrinking to a at N = oo, the proof is more in lines with the above if we make the 
present convention. In particular, retracing those arguments through the first line of (4.10), it 
is evident that 



Va riV (#J 



a, a- 



iV-l 



2C+ VNa _ x 2 j 2 dx 



\£\<Vns 



e 



2vr 



2C+ VNa _ x 2 12 dx 



l }+£i(c,N). 



The error £±(c, N) plays the same role as £i(N) in (4.10) and is amenable to the same argu- 
ments. While certain aspects such as the estimate in (4.12) are now more elaborate due to the 
contribution of the upper limit in the integral, everything goes through just as easily if one 
keeps in mind that c is at most o(y/N). Next the outer sum is replaced by an integral after an 
estimate nearly identical to (4.11). That is, the previous display may be continued, and the 
variance given by 



Na 



x Vvr 



7T 



2c+x' j 
2/ 2 ax 



2c+x' 



-x 2 /2_ 



=\ jdx' -.= (l—y mod ( c ) (4.13) 



up to order one errors. Recall that this holds for either c fixed, completing the proof in that 
case, or c = c{N) | oo. In the latter setting, one may easily check that I mo d(c) increases to its 
value 1 = I mo d{co) which explains the statement at those scales. 

We finish with the case that c [ 0. Here the Edgeworth expansion is used slightly differently 
and the growth of the variance is accounted for by 
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(4.14) 



as may be understood upon further review of (4.10). In this expression it is only the first sum 
which plays a role in the asymptotics. Integrating by parts in that term we find that 
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e 2(x+ JtJ 2 dx^j 



(1 + o(l)) + 0(Nc 



N(2ac)(l + o(l)) = N(b 2 - a 2 )(l + o(l)) 



after substituting back c = v A^(6 — a). The same computation shows that the second, or sum 
of squares, term in (4.14) is indeed of lower order. The proof is finished. 



Proof of Theorem 1.7 (radial case) Whatever the relation among a,b,c and d, 



Cov N (#l[a,b],#l[c,d]) = ^ [pjv,fc([a, b] n [c,d]) - p Ntk ([a,b])p Ntk ([c,d]) 



k=i 
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and the estimates obtained in the course of the previous proof may be revisited. Beginning 
with the situation [a 2 ,6 2 ] n [c 2 ,d 2 ] = 0, the covariance reads as 

N 

Cov N (#l[a,b},#l[c,d}) = -Y,PNA[a\b 2 \)p N)k {[c 2 ,d 2 }). (4.15) 

k=l 

Taking for definiteness b < c, this sum may be bounded in a simple way as in 

N 

^2PN,k([a 2 ,b 2 }) PN , k ([c 2 ,d 2 })< Yl P(s k >Nc 2 )+ Y P(s k <Nb 2 ) 

fc=l l<k<N(c 2 +b 2 )/2 N(c 2 +b 2 )/2<k<N 

with the first term on the right (being indicative of either) subject to 

N(c 2 +b 2 )/2 N(c 2 +b 2 )/2 2 

Y P{su>Nc 2 )<e- Nc2 (rp) e k < Cl Ne- N ^- d+c! ^\ 

k=l k=l 

withe' = (c 2 +b 2 )/2c 2 < 1, ensuring that 1— c'+e'loge' > 0. This second bound just maximizes 
the summands, while the first is the usual trick of optimizing in Chebychev's inequality (as used 
in (4.7)). That the right hand side of (4.15) vanishes exponentially fast in TV is established. 
The case when [a 2 ,6 2 ] C [c 2 ,d 2 ] follows in kind. Then 

N 

Cov N (#J[a, b],#I[c, d\)= 5>7V,fc([a 2 , b 2 ]) (1 - p N , k ([c 2 , d 2 ])), (4.16) 

k=i 

but since now [a, b] and [c, d] c are disjoint this expression is really quite the same as (4.15) 
and thus is also exponentially small (though positive) as N f oo. 

The last two cases in which the intervals share a boundary point, either a < b = c < d or 
a = c while say b < d, also go hand in hand. In the former we have: with 5n = \/log N, 

N 

Cov^(#J[a,fe],#T[6,d])= -^ mfc ([a 2 ,6 2 ]) mfc ([6 2 ,rf 2 ]) (4.17) 

k=i 

PN,k{[Q,b 2 })p N)k {[b 2 \ oo)) 

\k-Nb 2 \<S N VN \k-Nb 2 \>S N VN 

Here the choice of Sn and the last line are just reprises of the proof of Theorem 1.6. Finally, 
if a = c the covariance is 

N N 

Co Vjv (#J[a, b], #I[a, d]) = J>jv,fc([0, a 2 \)p N)k {[a 2 , b 2 ]) + ^ mfc ([ a 2 , b 2 ])p Ntk ([d 2 , oo]), 

fc=i fc=i 

where we see that the first sum is of similar type to that in the first line of (4.17) and so 
produces the claimed §\/TV + o(l) as N f oo. The second term does not affect this outcome; 
having the same form as either (4.15) or (4.16) it is exponentially small. The proof is finished. 
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5 Extensions to the Quaternion Ensemble 



Replacing the complex Gaussian entries in the definition of P/v with real quaternion Gaussians 
produces another solvable ensemble first by studied Ginibre. In this case the eigenvalue 
density has the form 

dP®{zi, ■ ■ ■ zn) = —q [J \ze - z k \ 2 \z e - z k \ 2 ]j \z k - z k \ 2 dp, N (zi) ■ ■ ■ dfi N (z N ) 

Z N l<t<k<N l<k<N 

where now djl(z) = e~ 2N \ z \ 2 d^t(z)dSs(z) is the appropriately scaled weight to keep the limiting 
density of states uniform on \z\ < 1. The introduction of the additional differences-squared 
terms breaks the rotation symmetry of P/v. It also breaks some of its analytic structure. While 
there again exist explicit formulas for the finite dimensional marginals or correlation functions, 
these are now expressed in terms of quaternion determinants or pfaffians. That is, P® is not 
a determinantal point field. Still, part of the previous discussion extends readily to this model 
for the reason that the radial components of the eigenvalues under P® have a nearly identical 
statistical description to those in the P/v ensemble. 

The fact is that in either P/v or P® one may integrate out the angles from the start, 
producing a full joint density on the moduli {r k = \z k \,k = 1,...,N} (see [13] for related 
remarks). The resulting densities possess their own structure, and for certain analysis it is 
not important whether or not they derived from a determinantal setup. We start with P^r. 
Returning to the original expression (1.1), we simply expand the Vandermonde determinants 
to find that: with a and tp permutations on N letters and k = &rg(z k ), 

dP N (n,---,r N ) (5.1) 

1 c N r •> N 

= ^E S ^) S ^(V0{II / zl k ~ l ^ k - l de k }*X{e~ Nr tr k dr k 
N a,ip fc=l"' _7r k=l 

= mJ^Pn ( r<J i "> p( N ( r °2 )---P { n ] ( r °N )*■!■■■ dr N 

in which p^(r) = r 2t ~ 1 e~~ Nr2 , normalized to be a probability density on r > 0. For P® use is 
made of the identity 



det 



zt 1 Si*- 1 



\\ W - z k \ 2 \z£ - z k \ 2 J J {z k - z k ). 



Kk<2N,KKN 
~ ~ Ki<k<N Kk<N 



Expanding the left hand side, multiplying by \\{z k — Zk)e N \ Zk \ 2 and integrating over the {9k} 
variables produces: with ■0 now a permutation on 2N letters, 



N N 



dPg( ri , ■■■,tn) = ^qJ2 s ^(V0 f[ { f 4 2k ~ 1 ~ 1 ^ 2k -\z k - z- k )de k } x n e~ 2Nr lr k dr k . 

Z N V k=l k=l 

Noting that only terms with \tp2k-i ~ ^2ifc| = 1 f° r a H k contribute, we continue the computation 
as in 

dPftfa, • • • ,rjv) = ^2p ( 2n ] (r ai )p^ ] (r a2 ) • ■ • (r aN ) dn ■ ■ ■ dr N . (5.2) 

a 
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Here we are back to an N th order permutation in a and the p M have precisely the same 
definition as in their appearance in (5.1). For background on this and related computations 
we refer to [24], Appendix 18. 

Certainly the covariance formula (1.2) may be derived directly from (5.1), circumventing 
the need for computing correlations. It follows from (5.2) that the covariances of general linear 
statistics on the moduli in the P® ensemble will satisfy the same formula, so long as each 
appearance of -^sg is replaced with ^S2g- The asymptotics follow suit with identical proofs: 

Corollary 5.1 Under the covariances of linear statistics of the radial components have 
the identical N f oo asymptotic behavior to their counterparts in the Pjv ensemble described in 
Theorem 1.6 and the latter half of Theorem 1.7. 

Further, it should be clear that, by repeating the proof of Theorem 1.3, we may obtain 
Gaussian fluctuations for smooth radial linear statistics in P®. More interesting, for indicator 
type test function, while we may no longer quote the Costin-Lebowitz result directly, we do 
have the following. 

Theorem 5.1 The properly normalized radial number statistics #T[a, b] under P® also satisfy 
a central limit theorem as long as N(b — a) f oo as N j oo. 

The proof of Theorem 5.1 is actually a slight reworking of Costin-Lebowitz; for that reason 
it is postponed until the next section. 

Remark While given by a permanent, point processes of type (5.1) or (5.2) are not the bosonic 
fields appearing in the mathematical physics literature (see [10] and references therein). They 
are in fact simpler objects; it would be be interesting to know if something of their kind arose 
in other contexts. 



6 Appendix 

For completeness we include the statement and a sketch of the proof of the Costin and Lebowitz 
theorem. In keeping with the setting and notation used above, we say a measure P/v on is 
a determinantal point field if all ^-dimensional correlation functions p^,g may be written as 

N\ r i 

p N ,e(zi,z 2 , ■ ■ ■ , zg) = j^- — Try det K N (zi, zj) Pn{oIzi) ■ ■ ■ t^Nidzg) 



{N-e)\ 



i<i,j<e 



(compare (2.2)) in which Kjy is a Hermitian kernel (K^(z,w) = Kn(w,z)) satisfying ||i^v|| < 
1 as an operator on L 2 (C, p^) for some measure /ijy. 

Theorem 6.1 (O. Costin, J. Lebowitz) Let Pn be a determinantal random point field 
and let f^A equal the number of points lying in A C C under that measure. As long as 
Var N [#A] T oo with N | oo then (#A - E[#A])/y/Var N [#A] converges in distribution to a 
Gaussian random variable of mean zero and variance one as N j oo. 
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Proof Following [28] as well as the original [7], the proof checks that the cumulants Cf(#A) 
fall into line as N gets large. Recall that for any random variable X its i-th. cumulant Ce(X) 
is defined through 

OO / -,\/> 

log£[e^]=£Q(X)^ 

l=i 

and that X is Gaussian if and only if C e (X) = for all £ > 2 and C 2 (X) > (see [22]). 

Next define An = K^Ia, the restriction of Kn to L 2 (A,pn) and notice that (compare 
2.4) E N [#A] = C^(#A) = Trace(yljv) and Yar N [#A] = (#A) = Traced - A%). For 
the higher cumulants, there is the following recursion: 

e-2 

Cf{#A) = (-if (I- 1)! Traced - A N ) + J> M Cf (#A). (6.1) 

k=2 

in which {a k e} are certain combinatorial factors. The origins of (6.1) lie in the connection 
between the cumulants and the so-called cluster (or Ursell) functions. The latter are defined 
in terms of the correlation functions. In particular, with UN,k denoting the Appoint cluster 
function, 

u N ,k{z 1 ,z 2 , ■■-,z k ) = ^{-l) k ~ m {m - 1)! p Nj{Sll ■ ■ ■ p Nj \s m \ (6-2) 

where the sum is over m > 1 and, for each m, all partitions of {1, 2, • • • , k} into (nonempty) 
subsets Si etc. (also, P\s 1 \,n indicated the |5i|-point function in the variables corresponding 
to that partition). Defining also 

U k N (A) = ■■ u N)k (zi,Z2,...,Zk)dxidyi---dx k dyk, (6.3) 
J A J A 

the advertised connection is provided by the identity 

OO OO 

k=l ' k=l 

This last display implies that 

e-2 

Cf{#A) = J2 <*kiC%{#A) + (-l) e (£ - 1)! C?{#A) + U?(A). (6.4) 

k=2 

And finally, the fact is that, for determinantal point fields, Uf(A) = (— 1)^ _1 (.£— 1)! Trace(^y), 
explaining (6.1). 

Now note the simple estimate 

< Traced - A N ) <{£- l)Trace(,4jv - A%) (6.5) 

which may be checked by writing Trace(A/v — A N ) = Ylj=\ Trace (A^ N — A^ 1 ) < Ylj=i 
Hvl-j^ 1 ]] Trace(^Ar — A 2 N ) and recalling that ||.f£jv|| < 1- Substituting (6.5) into (6.1) we have 



24 



that Cf (#A) = 0(C£{#A)) for all I > 2. And since C£(#A) | oo as iV | oo, we also have 
that 

N ( #A-E N [#AU C?{#A) 
1 V y/Var N [#A] > ~" 
for £ > 2. The proof is finished. 

Proof of Theorem 5.1 Consider more generally a set of probability densities {pN,k} (k = 
1,2,... N) on C and the point process defined for each N by joint density 

P N ( Zl ,...,Z N ) = PN,*(l)(zi)PN,a(2)(z2)---PN,*(N)(zN)- 
permutations a 

We show that the number statistic for such an ensemble satisfies a central limit theorem if its 
variance grows without bound as N f oo. This may then by applied to the particular case of 
(5.2) in which a choice of PN,k has been made (= p^ff m that notation), things are restricted 
to the positive reals rather than all of C, and the needed growth of the variance has already 
been checked. 

This proof also rests on the behavior of the cumulants. While not determinantal, the 
correlation functions now have the simpler form: 

P£,n{zi,- ■ ■ ,ze) = ^ PN,ki( z ^)PN,k 2 ( z 2)---PN,k e ( z e)- 

l<ki,k 2 ,-ke<N 

So, returning to the Costin-Lebowitz proof, we make use not of (6.1), but instead the prelimi- 
nary cumulant formula (6.4). With as in the previous display, a calculation in (6.3) and 
(6.2) shows that 

N 

u e N (A) = (-iy-\£-iy.j2\pNAA)] e 

k=l 

in the present case. That is, the growth of the I th cumulant is now tied to Ylk=i(PN,k{A) — 
p l N k (A)). But, p — p e < (£ — l)(p — p 2 ) for t > 2 and any p G [0, 1], which is to say that I th 
cumulant is dominated by a constant multiple of the second. The proof is finished. 

Acknowledgments A serious debt of gratitude is due to the referee. In addition to helping 
correct various small glitches, their thoroughness identified some important errors in an earlier 
version of this paper. 
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